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Abstract. The purpose of this note is twofold. First, we survey the study of the percolation phase tran- 
sition on the Hamming hypercube {0, l} m obtained in the series of papers GD ED HH ED . Secondly, we 
explain how this study can be performed without the use of the so-called "lace-expansion" technique. To 
that aim, we provide a novel simple proof that the triangle condition holds at the critical probability. We 
hope that some of these techniques will be useful to obtain non-perturbative proofs in the analogous, 
yet much more difficult study on high- dimensional tori. 



1. Introduction 



This paper is intended to be a companion to the papers |9j [QJJ dH |24) in the setting of percola- 
tion on the Hamming hypercube {0, \} m . Our goal here is to present the most recent state of affairs 
of this topic, emphasizing ideas, techniques and the gaps in our understanding. We will present one 
novel proof of a result obtained in [10], namely, that the triangle condition on the hypercube holds 
^ ■ at the critical probability This proof is simpler than the one presented in [TO] as it does not use the 
lace-expansion technique. Given this estimate, it will be indicated in this note how the study of the 
qualitative properties of the phase transition on the hypercube can be obtained withoutthe use of the 
lace expansion. While this has yielded a non-perturbative proof in the hypercube setting, "unlacing" 
the proofs in the setting of high-dimensional tori seems much more difficult and requires new ideas. 
^ . We hope that the hypercube study will get us closer to this goal. 



The paper is organized as follows. In Section fTTTl we describe the phase transition in the Erdos and 
Renyi random graph, our main source of inspiration and in Section [L2l we describe the analogous 
results, obtained in |9j[T0j|24], in the setting of the hypercube. We proceed in Section fOl to introduce 
(S) ! and discuss the role of the so-called triangle condition for percolation. This condition arises fairly 
naturally in the study of percolation and to exemplify this, we present a classical argument of Barsky 
and Aizenman |4| showing how to control the expected cluster size using the triangle condition. Next, 
in Section fOl we present some conditions about the behavior of the random walk on the underlying 
graph and state a general theorem that allows us to analyze the phase transition in percolation on 
any graph satisfying these conditions. In Section [T31 we restrict our attention back to the hypercube 
setting and verify that the random walk conditions of the previous section holds. In particular, we 
state there the required estimate on p c (Theorem l 1.61 1 which we prove in this paper. We conclude this 
chapter in Section [L6l by discussing open problems. 

Section|2]contains an overview of the proof of Theorem l 1 .3l and Section|3]provides a proof of Theo- 
rem ll.6l which together with the argument in Section lZ3l yields a simple proof that the triangle condi- 
tion holds on the hypercube. 

1.1. The Erdos and Renyi random graph. Recall that G{n, p) is obtained from the complete graph by 
retaining each edge of the complete graph on n vertices with probability p and erasing it otherwise, 
independently for all edges. Write ^ for the y'-th largest component obtained this way. An inspiring 
discovery of Erdos and Renyi (T5) is that this model exhibits a phase transition when p is scaled like 
p = cln. When c < 1 we have 1^1 = 0(logn) whp and 1^1 = &{n) whp when c> 1. Here, we say that 
a sequence of events E n occurs with high probability (whp) when lim^oo P[E n ) = 1. We further write 
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f{m) = 0{g{m)) if |/(ra)|/|g(ra)| is uniformly bounded from above by a positive constant, f(m) = 
0(g(ra)) if ftm) = 0(g(m)) and g(ra) = 0(/(ra)), f(m) = H(g(ra)) if l//(m) = 0(l/g(ra)) and f(m) = 
o(g(m)) if f{m)l g[m) tends to with m. 

The investigation of the case c close to 1, initiated by Bollobas ID and further studied by Luczak 
132] , revealed an intricate picture of the phase transition's nature. See \7\ for results up to 1984, and 
IU|27l|28l|33] for references to subsequent work. We briefly describe these now. 

The subcritical phase. Let e n = o(l) be a non-negative sequence with e n » n~ 113 and put p = (1 - 
e n )ln, then, for any fixed integer j > 1, 

" 1 > 

2e~ 2 log(£:^) 

where — ► denotes convergence in probability. 

The critical window. When p = (1 + 0(n~ 1/3 )) / for any fixed integer j > 1, 

V n 2/3""' n 2/3j 

where (Xi)Li are random variables supported on (0,oo), and — denotes convergence in distribution. 

The supercritical phase. Let e n = o(l) be a non-negative sequence with e n » «" 1/3 and put p = 
(l + e n )/n, then 

l^i I p 



1, 

while, for any fixed integer j > 2, 



1. 



2e n l \og{£in) 



1.2. The phase transition on the hypercube. One inherent difficulty for percolation on the hyper- 
cube, or on any finite graph, is that it is not obvious how to define its critical value. In the Erdos and 
Renyi random graph this critical probability should turn out to be (1 + 0(n~ li3 ))/n in whichever defi- 
nition we use! When p = (1 + 0(n~ 113 ))/ n in G{n, p) we have that the mean cluster size is of order n 113 . 
This inspired Borgs, Chayes, the first author, Slade and Spencer (SOLOim t0 su gg est that the precise 
location p c = p c {X) of the phase transition is the unique solution to the equation 

E Pc |^(0)| = A2 m/3 . (1.1) 

where ^(0) is the connected component containing the origin, |^(0)| denotes its size, and A £ (0, 1) 
denotes an arbitrary constant that is typically taken to be small. Here 2 m/3 can be viewed as the cube 
root of the volume of the graph, i.e., its number of vertices. There are several other more intuitive 
definitions (see the discussion in Section 7 of (34 1), however, in order to justify these definitions one 
needs to show that analogous results to the ones described in Section [TTT1 holds with this definition. 
To the best of our knowledge this was only done with (T7Q. Let us now describe the phase transition 
of percolation on the hypercube around this p c . 

From here on, we take p c = p c {X) with A e (0,1) a fixed constant. The phase transition on the 
hypercube is described in the following three theorems, in all of which we consider bond percolation 
on the hypercube {0, \} m with varying p. 
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Theorem 1.1 (The subcritical phase EldO)). Put p = p c {\ - e m ) where e m = o(l) is a positive sequence 
satisfyinge m » 2~ ml3 . Then, for all fixed 6 > 0, 

P p (e m 2 /3600< |%| < (2 + 5)£ m 2 log(42 ra )) = l-o(l), (1.2) 

and 

l + o(l) 

E p 1^(0)1= ■ , ■ (1.3) 

\£m\ 

Theorem 1.2 (The critical window EJCEO)). Put p = p c {\ + e m ) with \e m \ = 0{2~ ml3 ). Then, 

P p (w _1 2 2m/3 < |^i| < (x)2 2ml3 ) > 1 - 0{oj~ 1 ) . (1.4) 

and 

E p |^(0)| = 0(2 m/3 ). (1.5) 

Theorem 1.3 (The supercritical phase |24|). Put p = p c {l + e m ) where e m = o(l) is a positive sequence 
withe m »2~ ml3 . Then 

l^i I 



1, (1-6) 



V 

where — ► denotes convergence in probability, and 



E p |^(0)| = (4 + o(l))42 m . (1.7) 



Furthermore, the second largest component^ satisfies 

\ c &l\ V 



0. (1.8) 



Theorems 11.11 and 11.21 are proved in [lOl . The work in |9l [10] did not provide sharp estimates 
for the supercritical phase and the authors conjectured (see Conjecture 3.2 in (HI) the statement of 
Theorem ll.31 proved in |24) . Thus, Theorems l 1 . 1141 .3l fully identify the phase transition and the critical 
window in the hypercube. 

1.3. The role of the percolation triangle condition. Let us briefly review the study of random sub- 
graphs of general finite transitive graphs initiated in ©[[0) . Let G be a finite transitive graph and write 
V for the number of vertices of G and m for its degree. Let p e [0, 1] and write G p for the random 
graph obtained from G by retaining each edge with probability p and erasing it with probability 1 - p, 
independently for all edges. We also write P p for this probability measure. We say an edge is p-open 
(p-closed) if was retained (erased). We say that a path in the graph is p-open if all of its edges are p- 
open. For two vertices x, y we write x «->■ y for the event that there exists a p-open path connecting x 
and y. For an integer j > 1 we write for the y'-th largest component of G p (breaking ties arbitrarily) 
and for a vertex v we write °€{v) for the component in G p containing v. 
For two vertices x, y we denote 

V P (x,y) =£p p (x~ u)P p {u~ v)P p (v~ y). (1.9) 

u,v 

The quantity V p (x, y) , known as the triangle diagram, was introduced by Aizenman and Newman 1 1 1 
to study critical percolation on high-dimensional infinite lattices. In that setting, the important fea- 
ture of an infinite graph G is whether V Pc (0, 0) < oo. This condition is often referred to as the triangle 
condition. In high-dimensions, Hara and Slade |21 1 proved that the triangle condition holds. It allows 
to deduce that numerous critical exponents attain the same values as they do on an infinite regular 
tree, see e.g. (H m HH [30] . 

When G is a finite graph, V p (0, 0) is obviously finite, however, there is still a. finite triangle condition 
which in turn guarantees that random critical subgraphs of G have the same geometry as random 
subgraphs of the complete graph on V vertices, where V denotes the number of vertices in G. That is, 
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in the finite setting the role of the infinite regular tree is played by the complete graph. Let us make 
this heuristic formal. 

We always have that V — ► oo and that A e (0, 1) is a fixed and small constant. Let p c = p c (A) be 
defined by 

E Pc(A) |^(0)| = AV 1/3 . (1.10) 

The finite triangle condition is the assumption that V ' Pc {\) {x, y) < \{ X =y] + a®, for some a = a (A) suf- 
ficiently small. The strong triangle condition, defined in 1 10, (1.26)], is the statement that there exists 
a constant C such that for all p< p c , 

V p (x,y)<l {x=y} + Y +o(l), (1.11) 

where o(l) tends to as m — ► oo. In fTOl . 11.1111 is shown to hold for various graphs: the complete 
graph, the hypercube and high-dimensional tori Z^. Its proof relies on the lace expansion, a pertur- 
bative technique to investigate the two-point function V Pc (0 < — ► x) that was first used for percolation 
on the high- dimensional infinite lattice by Hara and Slade 121). The lace expansion is an extremely 
powerful technique, but is also quite involved. We feel that it should only be used when more ele- 
mentary techniques fail. Apart from surveying the literature on hypercube percolation, our aim in 
this paper is to show that Theorems 1 1 . 1 H 1 .31 can be proved without relying on the lace expansion. 

The main result of (9) is that the triangle condition implies strong estimates on |%| in the critical 
and subcritical case: 

Theorem 1.4 (1 9 1). Consider bond percolation on a finite transitive graph G having V vertices and 
degree m satisfying the strong triangle condition ll.llt where m — ► oo as V — ► oo. Then the assertions of 
Theorems ] 1 . U and lL^ hold when each occurrence of2 m is replaced with V. 

The triangle condition is significant since it arises naturally in various calculation one performs. To 
indicate how this occurs, we provide here a classical argument of Barsky and Aizenman |4) control- 
ling the size of E p |^(0)| as p varies. We put %{p) = E p [|^(0)|] and will show that the strong triangle 
condition 11.111 implies that for all p<p c , 

l + O(max^ Vp e (0, v)) 

x(p) = — ; * r W • (L12) 

m{p c - p) + %{p c ) 1 

In particular, 11.1211 implies that x(p) = Q(^ 1/3 ) whenever p < p c is in the scaling window, i.e., when 
m{p c - p) = 0(V~ 1/3 ) as stated in 11.51 in Theorem ll.21 Equation 11.121 1 also proves 11.31) in Theorem 

o 

We start by proving the upper bound in 11.121 . We remark that this bound is valid for all transitive 
graphs (that is, we do not require here the triangle condition). By Russo's formula, 

-^-P p (0< — ► x) = £ V p {{u,v) is pivotal for 0< — ► x), (1.13) 

dP (u,v)eE{G) 

where we say that a (directed) bond {u, v) is pivotal for < — ► x when (a) < — ► u and (b) < — ► x in 
the (possibly modified) configuration where the status of [u, v) is turned to occupied, while is not 
connected to x in the (possibly modified) configuration where the status of {u, v) is turned to vacant. 
Summing over x yields 

■J-XW= E E P P (( "-^) is pivotal for (1.14) 

d P x£V{G)(u,v)£E[G) 
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If [u, v) is pivotal for - — ► x, then there exist two disjoint paths of occupied bonds connecting 
and u, and v and x, respectively. Thus, {0 « — ► u) o {v < — ► x) occurs. The BK inequality |20| gives 

—%{p)< X E P P (0— u)V p {v^ x) = m X (pf. (1.15) 

d P xeV(G)[u,v)eE(G) 

We rewrite the last inequality as ^l(p) -1 > -m, and integrate over [p, p c ] to get 

X(Pc)" 1 -l(p)" 1 > -m{p c -p), (1.16) 

so that 

*W - -? VI < w ' (L17) 

m(p C - p)+Xtyc) 1 

showing the upper bound in 11.121 . For the lower bound we write 

V p ((u,v) is pivotal for < — ► x) = E p [i {0 ^ u} T ciMm (v,x)], (1.18) 

where S^"'^ (0) consists of those sites which are connected to without using the bond [u, v) and for 
a set of sites A, the restricted two-point function t a {v,x) is the probability that v is connected to x 
and every open path from v to x has all its edges not touching A. Clearly ^[v, x) <r{v,x), and 
this is in fact an easy way to prove BK inequality for this particular events. We note that 

P p (y — x)-t a (v,x) = P p (v~x), (1.19) 

where we write that v < — ► x when every open path from v to x has an edge touching A. Thus, 

4-XW= E E E p []l {0 _ M} ]Pp(v~ x) 

a P xeV(GH",f)e£(G) 



?(»>") (0) 



Now, for any AqZ c 



- £ £ E p [fl {0 _ M} P p (^^ UJ x)] 

xey(G)(«,y)e£(G) 

= m X {pf- X E Ep[]l !0 ^ u) P p (^ -4 x)]. (1.20) 

xeV(G)(«,i>)eE(G) 



P p (i/ ^ x) < Y,P p {{v — a} o {a — x})ll {fleA} , (1.21) 



which by BK inequality and summing over x leads to 

-^-X(p)>m^(p) 2 -^(p) ^ £P p (0— u,ae& u>v) {0))P p {v~ a) (1.22) 

>mxlpf-x(p) E E p p^ ^ m,0— a)P p (i/ — a). 

(w,y)e£(G) « 

If - — ► u and < — ► a, then there exists z such that {0 < — ► z} o {z « — ► u} o {z - — >• z}, so by the BK 
inequality 

■£-X(P)^™X(P) 2 -XW E £p p (0«— z)P p (z«— - u)Pp(z — a)Pp(^ — a). 

{u,v)eE(G) a,z 

The sum over a, z looks almost like the triangle diagram, except for the the annoying P p (0 < — ► z) 
factor. However, by transitivity the double sum on the right hand side remains the same if we replace 
by any other vertex. Hence we may sum this over 0, getting a factor of %{p), and add a factor of V~ l . 
This gives that 

-^-%{p) > mxipf-mxipf V Vp(0, v) < mj(p)maxVp(0, v), 
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f p (JC, y) + p (JC, V) ,1 

r mix = min It: max p — ^— ^- < (1 + odW 1 [, (1.24) 



implying that 

— *(p) > mx(p) z [l -maxV Pc (0, v)]. (1.23) 
We now integrate as we do in 111.161 1 and obtain the lower bound in 11.121 . 

1 .4. Random walk conditions for percolation. We now describe a general theorem, obtained in (24) , 
which allows to deduce as corollaries Theorems I1.1H1.3I under certain geometric conditions on the 
underlying graphs. These conditions are more restrictive than the triangle condition, (for instance, 
they do not hold in the case of high-dimensional tori, but do hold for the hypercube) but are easier 
to verify since they are expressed in terms of random walks. In particular, these conditions imply 
the strong triangle condition (and hence by Theorem ll.4l they imply Theorems ll.ll and ll.2l l, but more 
importantly they allow us to analyze percolation in the supercritical case, where the triangle condition 
ceases to hold, and obtain Theorem ll.31 

Let G be a finite transitive graph on V vertices and degree m. Consider the non-backtracking ran- 
dom walk (NBW) on it (this is just a simple random walk not allowed to traverse back on the edge it 
just came from). For any two vertices x, y, we put p l {x, y) for the probability that the walk started at x 
visits y at time t. We write r m jx for the uniform mixing time of the walk, that is, 

p f (x, y)+p f+1 (x,y) 

x,y 2 

where o(l) tends to slowly. Then the main result in (2U is as follows: 

Theorem 1.5 (1 24]). Let G be a transitive graph on V vertices with degree m and define p c as in [LID 
with A = 1 / 10. Assume that the following conditions hold: 

(1) m^oo, 

(2) [p c {m-l)] T ™* = 1 + 0(1), 

(3) For any vertices x, y, 

T ■ 

£ £ p h (x,u)p t2 (u,v)p t Hv,y) = oanogV). (1.25) 

U,V ti,t 2 ,t 3 =0 
fl + t 2 + f3£3 

Then, 

(a) the finite triangle condition [1.1 ID holds (and hence the assertions of Theorems ] 1 . hold), 

(b) for any sequence e = e m satisfyinge m » V~ 113 ande m = o(TZ^), 

-^-^1, E p |^(0)| = (4 + o(l))4v, ^L^o. (1.26) 

2e m V e m V 

In Section |2] we will part (a) of the Theorem above, and in Section [3] we will verify the conditions 
of the Theorem. Hence, we will obtain a proof that the triangle condition holds on the hypercube. 
Note that condition (2) involves both a random walk estimate (bounding r mix ) and a percolation es- 
timate (bounding p c ). Let us now discuss how the verification of these conditions is done in a rather 
elementary way. 

1.5. Back to the hypercube. It is a classical fact that the total-variation mixing time of the random 
walk on the hypercube is of order ralogra (32. A separate argument is needed to show that this is 
the correct order for r m bc since (a) we are dealing with the non-backtracking walk; and (b) we require 
a bound on the stronger uniform mixing time. This can be done by analyzing the transition matrix 
of the non-backtracking random walk using classical tools. This analysis is performed by the Fitzner 
and the first author in 1 18 1 and also allows us to verify condition (3) of Theorem ll.51 We will not delve 
further into this part of the proof. 
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Thus, given that T^ = mlogm, the verification of condition (2) in Theorem 1 1.51 in the case of the 
hypercube simply states that p c = + o{m~ 2 log m). This estimate (and more) was already proved 
in the work of the first author and Slade |25, 26] so no further estimates on p c were required in (24). 
However, the estimate we require is much weaker and the proofs in (25j [26] are difficult and rely on 
the lace expansion. In this paper we provide an elementary argument giving this estimate. This is 
the last piece in the "unlacing" puzzle which verifies condition (2) of Theorem 1 1.51 in the case of the 
hypercube. 

Theorem 1.6 (Unlacing the lace expansion in the hypercube). Consider bond percolation on the hy- 
percube {0, l} m . Then, there exists C > such that 

l + 5/(2m 2 ) + C/m -3 

Pc< (1.27) 

m-1 

Consequently, [{m - l)p c ] Tmix = 1 + o(l), so that the results in Theorems ] 1.4\ and \1.5\ apply (and hence 
also the assertions of Theorems ] 1 . ifTOl) . 

This theorem is the only novel result of this paper, and is proved in Section |3j Let us further briefly 
discuss the precise value of p c since it is related to the early literature on hypercube percolation. 

The asymptotic expansion of p c . The problem of establishing a phase transition for the appearance 
of a component of size of order 2 m was solved in the breakthrough work of Ajtai, Komlos and Sze- 
meredi 1 2 1 . They proved that when the retention probability of an edge is scaled as p = cl m for a fixed 
constant c> the model exhibits a phase transition: for c < 1, the largest component has size of order 
m whp, while for c> 1, the largest component has size linear in 2 m whp. See also [1711551 for proofs 
that the giant component has size ((c)2 m (1 + o(l)) whp when p - elm, where ((c) is the survival prob- 
ability of a Poisson branching process with expected offspring equal to c. Thus, p c ~\lm, but it was 
unclear at that time just how close it is. 

The first improvement to |2| was obtained by Bollobas, Kohayakawa and Luczak |8|. They showed 
that if p = (1 + e m )lm vAxhe m = o(l) but e m > 60m" 1 (log m) 3 , then l^l = (2 + o(l))e m 2 m whp. This 
raises the question whether p c = 1 / (m - 1), which is answered negatively in |25l|26l . These results give 
the most precise estimates on p c to date: 

Theorem 1.7 (Asymptotic expansion of p c (25j[26)). For bond percolation on the hypercube {0, \} m , 
there exist rational coefficients (flj) j>i with a\ = az = 1, a^ = 7/2 such that, for every s > 1, as m — • oo, 



s 



p c = J^a i m- i + 0(m- [s+1) ). (1.28) 

i=l 

Note that by Theorem ll.lUl.3l and ll.71 whatever s > 2 is, the largest cluster jumps from 0(m 2s_1 ) 
for p = £f =1 aim~ l + T]m~ s with r\ < to ®{2 m I m s ~ x ) for p = £f =1 aim' 1 + r]m~ s with 77 > 0. Thus, the 
phase transition in 77 is extremely sharp for every s>2 fixed. 

1.6. Open problems. We here collect a list of what we consider to be important open problems in 
this area. Some of these problems appear in |24, Section 8], but not all. 

1. Percolation on high-dimensional tori. Consider bond percolation on the nearest-neighbor 
torus where d is a large fixed constant and n — >■ 00 with p = p c (l + e n ) such that e n » n~ d ' 3 
and e n = o(l). Show that \ c €\\l(e n n d ) converges to a constant. Does this constant equal the 
limit as £ I of e~ 1 6 Z d (p c (l + £))? Here 6 Z d [p) denotes the probability that the cluster of the 
origin is infinite at p-bond percolation on the infinite lattice Jr. 

2. Identify the scaling limit of cluster sizes in the scaling window. Show that (| < ^/|2 -2m/3 )y>i 
converges in distribution when p = p c [l + t2~ ml3 ) and t £ U is fixed and identify the limit 
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distribution. This should be the limiting distribution of critical clusters in G{n, p) as identified 
byAldous |3). 

We remark that in |23) it is proved that any subsequential limit of {\ c &i\2~ 2ml3 } m >i is a proper 
random variable, that is, when P(X = E[X]) < 1. This non-concentration is the hallmark of 
critical behavior. 

3. Prove that the discrete duality principle holds for hypercube percolation. Show that |^i | = 
(2+o(l))£ -2 log(£ 3 2 m ) when e m « -2~ ml3 and e m = o(l) and that |^ 2 I = (2+o(l))£ -2 log(£^2 m ) 
when p = p c {\ + e m ) with e m » 2~ m ' 3 . This is also the content of [LT] Conjectures 3.1 and 3.3] 
and is proved for some values of e m in (8). In G{n, p) these results are proved in |36] and 1 28 , 
Theorem 5.6]. 

4. Prove a central limit theorem for | . Showthat |^i| satisfies a central limit theorem through- 
out the supercritical regime. In G{n, p) this and much more was established by Pittel and 
WormaldEU. 

5. Unlace the asymptotic expansion of p c . Find a proof of Theorem l 1 .7l that does not rely on the 
lace expansion. Possibly, the ideas in the proof of Theorem ll.6l in Section[3]can be used. 

6. Compute further coefficients of the asymptotic expansion of p c . Find the numerical values 
of ai for i > 4 in Theorem 1 1.71 There is a large physics literature on asymptotic expansions 
of critical values. See e.g., (25] for some of the references. We expect that a 4 = 16, as the first 
4 coefficients of the asymptotic expansion of p c can be expected to be the same as the ones 
for the asymptotic expansion of p c (Z d ) in terms of inverse powers of 2d (see e.g. [[9]). We 
also expect that a% is not equal to the 5th coefficient in the asymptotic expansion of p c (7- d ) in 
terms of 1 / {2d), which is predicted to be equal to 103 1 19] . Recently, substantial progress was 
made for the asymptotic expansion of the connective constant for self-avoiding walk on Z d , 
for which the first 13 coefficients have been computed by Clisby, Liang and Slade (see l T3l[T4l ). 

1.7. Acknowledgements. The work of RvdH was supported in part by the Netherlands Organization 
for Scientific Research (NWO). The work of AN was partially supported by NSF and NSERC grants. 
This work was presented by RvdH on the occasion of the Stochastik Tage 2012, held in Mainz March 
6-9,2012. 

2. Overview of the proof of the supercritical phase 

In this section we give an overview of the key steps in the proofs in |24) . From here on, we assume 
that e m is a sequence such that e m = o(l) but e 3 m V —>■ oo. 

2.1. Notations and tools. We write dc p {x,y) for the length of a shortest p-open path between x,y 
and put dc p {x,y) - oo if x is not connected to y in G p . We write x *-> y if do p {x,y) < r and x < = - y 

if do (x,y) = r and x y if dc p {x, y) e [a,b]. The intrinsic metric ball of radius r around x and its 
boundary are defined by 

B x (r) = {y: d Gp (x,y)<r}, 5B x {r) = {y: d Gp {x,y) = r} . (2.1) 

Note that these are random sets of the graph and not the balls in shortest path metric of the graph G. 
We often drop from notation and write B[r) for B [r) whenever possible. 

2.2. Tails of the supercritical cluster size. We start by describing the tail of the cluster size in the 
supercritical regime. 

Theorem 2.1 (Bounds on the cluster tail). Let G be a finite transitive graph of degree m onV vertices 
such that the finite triangle condition fiJil) holds and put p = p c {\ + e m ) where e m = o(l) ande m » 
V~ 113 . Then, for the sequence ko = e~^{e 3 m V) lli , 

P(IW)|2s*b)=2e m (l + o(l)). (2.2) 
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This Theorem is reminiscent of the fact that a branching process with Poisson progeny distribution 
of mean 1 + e has survival probability of 2e(l + 0(e)). Upper and lower bounds of order e for the 
cluster tail were proved already in 1 10] using Barsky and Aizenman's differential inequalities (U, and 
were sharpened in EU Appendix A] to obtain the right constant 2. 

Let Z> fc denote the number of vertices with cluster size at least k, i.e., 

Z, k =\{v:\^(v)\>k}\. (2.3) 
We use Theorem l2.1l to show that Z> kQ , with k as in the theorem, is concentrated. 
Lemma 2.2 (Concentration of Z> kQ ). In setting of Theorem l2J[ ifm ->• oo, then 

— ^^1, and E|^(0)| < (4 + o(l))e 2 V\ (2.4) 
2eV 

Lemma lZ2l immediately proves the upper bound on Y€\ | in Theorem ll.3l 

Proof of upper bound on \ c € x \ in Theorem[DC Note that {|^i| > k\ = {Z> k > 1}, so that |^i| < Z> k on 
the event {Z> fc > 1}. Applying this to k = ko and using Lemma l2~2l proves the upper bound in Theorem 
O □ 

2.3. Uniform connection bounds and the role of the random walk. We expand here on one of our 
most useful estimates on percolation connection probabilities. In its proof, a simple key connec- 
tion between percolation and the mixing time of the non-backtracking walk is revealed. In the anal- 
ysis of the Erdos-Renyi random graph G(n, p) symmetry plays a special role. One instance of this 
symmetry is that the function /(x) = P(0 <->■ x) is constant whenever x ^ and its value is precisely 
(V - 1)" 1 (E| < ^(0)| - 1) and 1 when x = 0. Such a statement clearly does not hold on the hypercube at 
p c : the probability that two neighbors are connected is at least p c > m" 1 , while the probability that is 
connected to one of the vertices in the barycenter of the cube is at most \/m2~ m E\ c ^{0) \ by symmetry. 

A key observation in the proof of Theorem ll.3l in |24| is that one can recover this symmetry as long 
as we require the connecting paths to be longer than the mixing time of the random walk, as shown 
in IH Lemma 3.12]: 



Lemma 2.3 (Uniform connection estimates). Perform bond percolation on any graph G satisfying the 
assumptions ofTheorem \1.5[ Then, for every r > r m ix and any vertex xeG 

„ m [r mix ,r] E|5(r)| 

P Pc (0< ►x)<(l + o(l)) — - — , (2.5) 

where r m jx is uniform mixing time as defined above Theorem ] 1.3 In particular, 

PpA0 ^ x)sa+om) ^m. (2. 6) 

The proof of the above lemma is short and elementary, see (24). There it is also shown how to ob- 
tain similar estimates for p = p c {\ + e) (with an error depending on e). The uniformity of this lemma 
allows us to decouple the sum in the triangle diagram and yields a simple proof of the strong triangle 
condition, as we now show. 

Proof of part (a) of Theorem ll.51 Let p < p c . If one of the connections in the sum V p (x, y) is of length 
in [r m jx,oo), say between x and u, then we may estimate 

^ IT ■ oo) (l + O(l))E„|^(0)| ^ 

£p p (x UieU u )P p (u - v)P p (v - y) < — J^P p (u - v)P p (v - y) (2.7) 



(l + o(l))(Ep|^( 0)|) 
V 



u,v 
3 
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where we have used Lemma 1231 for the first inequality. Thus, we are only left to deal with short con- 
nections: 

V p (x,y) < £P p (x ~ u)V p {u £=5E v) p piu h^y) + 0(xipf/V) . (2.8) 

u,v 

We write 

T - ^rni x _ f 

P p (x^u)= ^Pp(x-u), (2.9) 

fl=0 

and do the same for all three terms so that 

^mix _ t _ f _ f 

V p (x,y)<£ £ P p (x^i u)P p (u^v)P p (v^y) + 0( X (p) 3 /V). (2.10) 

U, V t\ , t2 , t3 

We bound 

P p (x~ u) < m(m-l) h ~ 1 p h (x,u)p tl , (2.11) 

simply because m{m - 1) fl_1 p f i [x, u) is an upper bound on the number of simple paths of length t\ 
starting at x and ending at u. Hence 



fjj^ ^mix 

V p (x,y)< _ n3 E £ mm-l)] h+t2+t *p t Hx,u)p t2 {u,v)v t3 {v,y) + 0(.x(pf/V). (2.12) 

V m J- J U,Vti,t 2 ,t3 



Since p < p c , assumption (2) gives that [p(m - 1)] f i +f2+f3 = 1 + o(l), and it is a simple consequence of 
condition (3) that 

^mi x 

X £ [p(m-l)] fl+t2+f3 p' 1 (x ) u)p f2 (u,i;)p f3 (^y)<l { x=y} + o(l), (2.13) 

U,V ty,t 2} % 

where o(l) vanishes asm^oo, concluding the proof. □ 

2.4. Most large cluster share large boundary. Since this is the most technical part of the overview, at 
the expense of being precise, we have chosen to reduce the clutter of notation and suppress several 
parameters from the notation. We ignore several dependencies between parameters and the skeptical 
reader is welcomed to read the more precise overview presented in (24) . 

Two parameters however play an important role. We choose r and r so that r » but just barely, 
and ro » r in a way that will become clear later. For vertices x, y, define the random variable 

Sr+ro (x, y) = | {(u, u') e E[G) : {x u} o {y ^2 u'} ,\B u (r + r )\- \B U , (r + r ) I < e~ 2 (E|5(r ) |) 2 } | . (2.14) 

The edges counted in S r+ro (x, y) are the ones that we are going to sprinkle. Informally, a pair of vertices 
(x,y) is good when their clusters are large and S r+ro (x,y) is large, so that their clusters have many 
edges between them. We make this quantitative in the following definition: 

Deflnition2.1 ((r, r )-good pairs). Wesaythatx,y are {r, r ) -good if all of the following occur: 

(1) dB x (r) 7^ 0, dB y (r) ji andB x {r) n B y (r) = 0, 

(2) |^(x)| > (e 3 m V) ll4 e-J and\^{y)\ > (e 3 m V) ll4 e-J, 

(3) S 2 ,+,- (x,y) > y- 1 m£ m 2 (E|5(r )|) 2 . 

Write P rjQ for the number o/(r, ro) -good pairs. 

Theorem 2.4 (Most large clusters share many boundary edges). Let G be a graph on V vertices and 
degree m satisfying the assumptions in Theorem UZBl Assume that e m satisfies e m » V~ 113 and e m = 

°( r miP- Then > 



(2e m V) 2 



1. 
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In light ofTheorem l2.11 we expect that the number of pairs of vertices {x, y) with |^(x) | > {e 3 m V) ll4 e~^ 
and \ c ^{y)\ > (e 3 m V) ll4 e~^ is close to (2e m V) 2 . Theorem [O] shows that almost all of these pairs have 
clusters that share many edges between them. Theorem 12.41 allows us to prove Theorem II .51 as we 
describe in more detail in the next section. 

The difficulty in Theorem l2.4l is the requirement (3) in Definition [270 Indeed, conditioned on sur- 
vival (that is, on dB x {r) ^ 0, dB y {r) ^ and that the balls are disjoint), the random variable S r+ro {x, y) 
is not concentrated and hence it is hard to prove that it is large. In fact, even the variable |B(ro) | is not 
concentrated. This is not a surprising fact: the number of descendants at generation n of a branching 
process with mean /i > 1 divided by p" converges as n — ► oo to a non-trivial random variable. Non- 
concentration occurs because the first generations of the process have a strong and lasting effect on 
the future of the population. In (24), we counteract this non-concentration by conditioning on the 
whole structure of B x (r) and B y {r). Since r is bigger than the correlation length (r » e~^), under this 
conditioning the variable S, ■+,-„ {x, y) is concentrated (as one would expect from the branching process 
analogy). 

2.5. Sprinkling and improved sprinkling. The sprinkling technique was invented by Ajtai, Komlos 
and Szemeredi (2) to show that |%| = 0(2 m ) when p = (1 + e)lm for fixed e > and can be described 
as follows. Fix some small 9 > and write p\ = (1 + (1 - 6)e)lm and p 2 > Oelm such that (1 - p\){\ - 
P2) = 1 - p. It is clear that G p is distributed as the union of the edges in two independent copies of 
G pi and G P2 . The sprinkling method consists of two steps. The first step is performed in G pi and 
uses a branching process comparison argument together with an Azuma-Hoeffding concentration 
inequality to obtain that whp at least c 2 2 m vertices are contained in connected components of size 
at least 2 Cim for some small but fixed constants c\, c 2 > 0. In the second step we add the edges of G P2 
(these are the "sprinkled" edges) and show that they connect many of the clusters of size at least 2 Cim 
into a giant cluster of size ©(2 m ). 

Let us give some details on how the last step is done. A key tool here is the isoperimetric inequality 
for the hypercube stating that two disjoint subsets of the hypercube of size at least c 2 2 m /3 have at 
least 2 m lm im disjoint paths of length C{c 2 )\/rn connecting them, for some constant Cfe) > 0. (The 
m 100 in the denominator is not sharp, but this is immaterial as long as it is a polynomial in m.) This 
fact is used in the following way. Write V for the set of vertices which are contained in a component of 
size at least 2 Cim in G pi so that V > c 2 2 m . We say that sprinkling fails when l^i | < C22 m /3 in the union 
G Pl u G P2 . If sprinkling fails, then we can partition V' = A i+j B such that both A and B have cardinality 
at least c 2 2 m /3 and any path of length at most C(c 2 ) \frh between them has an edge which is P2 -closed. 
The number of such partitions is at most 2 2 12 1 . The probability that a path of length k has a in- 
closed edge is 1 - p\ . Applying the isoperimetric inequality and using that the paths guaranteed to 
exist by it are disjoint so that the edges in them are independent, the probability that sprinkling fails 
is at most 

2 2m /2 c im _ r _ ,6e L ,c(c 2 )^m\ 2mimW0 = e -2 (1+o(1))m ( 2 15 j 

v m > 

which tends to 0. 

The sprinkling argument above is not optimal due to the use of the isoperimetric inequality. It 
is wasteful because it assumes that large percolation clusters can be "worst-case" sets, that is, sets 
which saturate the isoperimetric inequality (e.g., two balls of radius ra/2 - \pm around two vertices at 
Hamming distance m). However, it is in fact very improbable for percolation clusters to be similar to 
this kind of worst- case sets. In 1 24 1 , this is replaced by an argument showing that percolation clusters 
are "close" to uniform random sets of similar size, so that two large clusters share many closed edges 
with the property that if we open even one of them, then the two clusters connect. 



12 



REMCO VAN DER HOFSTAD AND ASAF NACHMIAS 



Improved sprinkling: Proof of Theorem 1 1.51 b). Recall that we already proved the upper bound on 
|^i | below Lemma l272"l so it remains to show that 

P p {m\ > (2 - o(l))e m V) = 1 - o(l) . (2.16) 

Recall that p = p c {\ + e m ) is our percolation probability and choose pi,p2 satisfying 

p 2 = 6e m /m, p c (l + E m ) = pi + 0--pi)p 2 , (2.17) 

where 9 > tends to extremely slowly so that p\ = [1 + (1 - o{\))e m )]p c . Denote by G Pl and G P2 as 
before. We first invoke Theorem l2.4l in G Pl and deduce that whp 

P rj0 = {\-o{l))Ae 2 m V 2 . (2.18) 

Now we wish to show that when we "sprinkle" this configuration in G pi , that is, when we add to the 
configuration independent p 2 -open edges, most of these vertices join together to form one cluster of 
size roughly 2e m V. We construct an auxiliary simple graph H with vertex set 

V(H) = {xe G pi : \<£{x)\ > {s 3 m V) m e^} , 

and edge set 

E{H) = {(jt,y) e V(Hf: x,y are (r, r )-good} . 
Lemma l2~2l and H2.18I I now imply that whp H is almost the complete graph, that is 

\V{H)\ = (2 + o{\))e m V, \E(H)\ = (l-o{l))4e 2 m V 2 . (2.19) 

Denote v = \ V[H) \ so that v = {2+o(l))e m V and write Xi,...,*,, for the vertices in G Pl corresponding 
to those of H. Given G Pl for which the event in (12. 191 1 occurs, we will show that whp in G Pl u G P2 there 
is no way to partition the set of vertices into M\ i+j M 2 = {x\,...,x v } with \M\\ > Q,{e m V) and |M 2 | > 
VL{e m V) such that there is no open path in G pi u G P2 connecting a vertex in M\ with a vertex in M 2 . 
This implies that whp the largest connected component in G pi u G P2 is of size at least (2 - o{\))e m V. 

To show this, we first note that the number of such partitions is at most Z 3 ^" 1 ^ 3 4 since |^(Xj)| > 
(4^) 1/4 £ m 2 . Secondly, given such a partition consisting of Mi and M 2 , we claim that the number 
of edges {u, u') e E{H) such that ue Mi and u' £ M 2 (note that, by definition, these edges must be 
Pi -closed) is at least 0.{e 2 m Vm). To see this, we consider the set of edges in H for which both sides lie 
in either Mi or M 2 (more precisely, the vertices of H corresponding to Mi and M 2 ). This number is 
clearly at most 

Mf + Mf < (4-n(l))e 2 V. 

Hence, by J2.19I I, the number of edges in H such that one end is in Mi and the other in M 2 is at 
least VL{e 2 m V 2 ). In other words, there are at least ce 2 m V 2 pairs (x,y) e Mi x M 2 such that S r+ro [x,y) > 
cV~ l me~^ (E|B(r )|) 2 . We choose r so that this is a large number. In total, we counted at least order 
e 2 V 2 - V _1 me^ 2 (E|B(ro)|) 2 edges [u, u') and no edge is counted more than |B M (r + r )|-|-B M '(r+r )| times, 
which is at most order e^{E\B(ro)\) 2 by the definition of S r+ro [x, y) and the second claim follows. 

Hence if l^i | < (2 - H(l))e V after the sprinkling, then there exists such a partition in which all of the 
above edges {u, u') are p 2 -closed. By the two claims above, the probability of this is at most 

2 3(4^ 3/4 (1 _ p2) c4^m = 0(1); 

since p 2 = Qe m lm and 9 goes to very slowly. This establishes the required estimate on l^il. 

We now use 112.161 1 to show the required bounds on E|^(0)| and |^ 2 |. The upper bound £1^(0)1 < 
(4 + o[l))e 2 m V is stated in Lemma [2721 and the lower bound follows immediately from our estimate on 
|^i |, since 

E\ C €(Q)\ = V- 1 £ E\^{v)\ = V- 1 J^E\^ j \ 2 >V- l E\^ 1 \ 2 >{4-oa))e 2 m V, 

veV(G) j>l 
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where the first equality is by transitivity, the second equality is because each component is counted 
\°€j | times in the sum on the left and the last inequality is due to 112.161 ) . Furthermore, by this inequality 
and Lemma l2~2l we deduce that 

£Ei^-i 2 = o(4y 2 ), 

and hence 1^1 = o[e m V) whp. This concludes the proof of Theorem ll.51 □ 

3. Unlacing hypercube percolation: Proof of Theorem 1 1.61 

The main result in this section is the following proposition: 

Proposition 3.1 (Expectation of intrinsic balls). Consider bond percolation on the hypercube {0, l} m 
with p = [1 + 5/ (2m 2 ) + B/m 3 ]/{m - 1) for some B > sufficiently large. Then, for m sufficiently large 
there exists ak>\ such that 

E|B(fc)| >2 m/2 /m 3 . (3.1) 
Consequently, p c < [1 + 5/(2m 2 ) + B/m 3 ]/(m- 1) = 1/ra + 1/m 2 + 7 1 {2m 2 ) + 0(l/m 4 ). 

The proof uses very elementary estimates on the non-backtracking random walk transition proba- 
bilities. For completeness we provide here the crude bounds that we will use, and remark that much 
more precise bounds are available in fT8l . 

Lemma 3.2 (NBW computations). Lete x = (1,0,..., 0) e {0,l} m ande XA = (1, 1,0,..., 0) e {0,l} m be hy- 
percube vectors. Then 

? 2 
P (0,e u ) 



m[m - 1) 

and for any fixed to>2 there exists C = C(fo) > such that for all t > to, 

p 2f (0,e u )<Cm" fo " 1 . 

Furthermore, 

p 3 (0,ei) 1 



m{m- 1) 

and for any fixed to>2 there exists C = C(fo) > such that for all t > to, 

p 2f+1 (0,ei) <Cm" trl . 

Proof. The equality involving p 2 (0, e^) is immediate since the probability that the non-backtracking 
walk takes any one of the two paths of length two from to e\ t \ is [m(m - 1)] _1 . For the second 
inequality, denote by X t e {0, l} m the location of non-backtracking random walk after t steps and 
by N t the number of l's in X t . First note that by symmetry we have 

P(X 2f = e u | X = 0, N 2t = 2) = — — - • 

m(m - 1) 

So let us estimate the probability that N 2t = 2. This event implies that N 2t -2t +k < 2? + 2 for any 
1 < k < 2t . Hence, on this event the process {Nzt-zto+k}^ is stochastically bounded below by a 
process {M^} 2 ^ that has independent increments taking the value 1 with probability 1 - 2t / m and 
-1 with probability 2 t Q /m and Mi satisfying < Mi < 2t . If N 2t = 2, then M 2fo < 2. We bound the 
probability of the latter event crudely: for it to occur there must have been at least to - 1 of the 2t 
increments that take the value - 1, hence 

P{N 2t = 2 | X = 0)<|^ i J(2fo/m) f °- 1 <Cm- fo+1 , 
where C = C(f ) > 0. This and the previous estimate concludes the proof of the bound on p 2f (0, ei,i). 
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The equality p 3 (0, e\) = [m(m- l)] -1 stems from the fact that there are precisely (m - 1) non- 
backtracking paths of length 3 from to e\, and the probability of taking each is [m{m - l) 2 ] -1 . The 
bound on p 2f+1 (0, e\) is performed almost identically to the bound on p 2f (0,ei,i), we omit the de- 
tails. □ 

We prove recursive bounds on E|dB(/c) | that form the key ingredient in the proof of Proposition l3.ll 
Before doing so, we recall some notation. For a subset of vertices A, we say that an event M occurs 
off A, intuitively, if it occurs in G p \ A. Formally, for a percolation configuration oj and a set of vertices 
A, we write oja for the configuration obtained from oj by turning all the edges touching A to closed. 
The event "Jl occurs off A" is defined to be {oj : oja g M\. We often drop p from the notation when it 
is clear what p is. This framework also allows us to address the case when A = A{oj) is a random set 
measurable with respect to G p , the most prominent example being A = B (r) for some r > 1. In this 
case, the event {M occurs off A{oj)} is defined to be 

{M occurs off A{oj)} = {oj : o)a{w) e M\ . (3.2) 

For this example, we shall rely on the fact that, for an arbitrary event M and A = B x {s) (see |24j (3.1)]), 

V{M off B x (s)) = ^P(5 X (5) = A)V{J{ off A) , (3.3) 

A 

For two events E, F, we letE o F denote the event that there exists a set of bonds B such that ojb e E 
and (]}& e F (here we abuse notation slightly and now use wg for the configuration obtained from oj 
by turning all the edges in B to closed). Then, the BKR-inequality states that 

Pp(£oF)<Pp(£)P p (F). (3.4) 
Lemma 3.3 (Recursive bounds on E\dB[k) |). For any c> there exists B > such that if 

l + 5/(2m 2 ) + B/ra 3 

P= ; 

m— 1 

then,form= m{B) sufficiently large and for any k> 1 satisfy ingW\B{k)\ <2 ml2 /m 3 , 

E\dB(k)\ > [l + c/m 3 ]E|dB(fc-l)|. (3.5) 

Proof. We prove the claim by induction on A:. Given c> we will choose B to be large at the end of the 
proof — this choice will not depend on k or m. Given B, we choose m so large that 

p = {l + 5/(2m 2 ) + B/m 3 )/(m-l) < (1 + 10/m 2 )/(ra - 1), 

so our upper bound on p is independent of B. 

We start by initializing the induction. We have that E|d£(l)| = mp, while E|d£(0)| = 1, so that indeed 
ED holds for k = 1 (for any B > and c> 0). Let k > 1 such that E\B(k)\ < 2 ml2 /m 3 and assume the 
induction hypothesis holds for any £ < k - 1. 

We will now estimate the conditional expectation of \dB(k)\ given B(k - 1). To be precise, when 
we condition on B{k - 1) we condition on all the open and closed edges touching a vertex of B{k - 2) 
(observe that since the graph is bipartite there cannot be two vertices of dB{k - 1) that are connected 
by an edge). This allows us to calculate B{k - 1) and note that edges from dB{k - 1) to dB{k) are not 
revealed. Given this information, for each vertex x e dB{k - 1) the number of edges that we have not 
revealed any information on is precisely m - Y. r .y~x ^{yeB(k-2)} ■ Hence, 

E[\dB{k)\\B{k-l)]>mp\dB{k-l)\ - p £ %eaB(fc-l),yeB(fe-2)} 

p2 y ii 

x,y: d{x,y)-2 



UNLACING THE LACE EXPANSION: A SURVEY TO HYPERCUBE PERCOLATION 



15 



where the last term comes from subtracting the vertices of \dB{k) | we counted more than once, which 
happens if they have more than one "ancestor" in dB{k- 1). 

We take expectations in both sides and bound the two subtracted sums. We split the first sum 
according to whether the edge (x, y) is open or not. If (x,y) is open, then we must have that y e dB{k- 
2) and that (x, y) is open off B[k-2) (in the sense of H3.2I Q. Otherwise, that is, if x e dB{k- 1), y £ B{k-2) 

=e 

and the edge (x,y) is closed, then there exists £ <k-3 and a vertex w such that {0 <— w}oP k _ l _ 1 [w,x)o 

k—£—2 

{w « ► y} occurs, where P n {w,x) denotes the event that there exists a path of occupied bonds of 

length n connecting w and x (this is a monotone event). Indeed, let j x and j y be two shortest paths 
connecting to x and y, respectively and take w to be the last intersection of these paths and (. to be 
index of w in the paths (this has to be the same number for both paths since they are shortest paths). 
Note that since the edge (x, y) is closed, w has to be at distance at most k - 3 from 0. Then the witness 
for the first event is B{£) (that is, all the open and closed edges touching B{£ - 1) and the two other 
witnesses are the parts of j x and j y starting at w and ending at x and y, respectively. Similarly for 
the second contribution, if x, y e dB[k - 1), then there must exists £ < k-2 and a vertex w such that 

=£ 

{0<- * w}oP k _ l _ 1 {w,x)oP k _ l _ 1 (w,y) occurs. 
We sum over £ and w and use the BK-Reimer inequality to obtain that 

E\dB(k)\ > mpE\dB(k-l)\-n-{l)-(U), (3.6) 

where 

{*) = p £ P(0— w,(x,w) isopenoffB(A;-2)), 

w,x:x~w 

and 

fc-3 

CD = p£ E p (°^ wmP k - £ - 1 {w,x)o{w k ~y}), 

f=0 w,x,y: x~y 

and 

2 k—2 

(II) = ^-£ X P[0^w)P{P k - e - 1 {w,x)oP k _ e _ 1 { W ,y)). 

^ (=0 W,x,y: d(x,y)=2 

We start by bounding from above the sum (*). By conditioning on B{k - 2) we may rewrite (*) as 
(*) = /?£ £ P(B(fc-2) = A) P((*. M>) is open off A\B{k-2) = A). 

w =k-2 x: x~w 

Note that the probability that (x, w) is open off A equals p for any x such that x ~ w and the edge 

=£ 

(x, w) is not in A. Since A is such that w and P(B(k - 2) = A) > 0, we learn that there are at most 
m - 1 such possible x's (instead of m, the total number of neighbors of w ). Hence 

(*) < (m-l)p 2 E|dB(A:-2)| < (m- l)p 2 E\dB(k- 1)|, (3.7) 

where in the last line we used the induction hypothesis. We proceed by bounding from above the two 
sums (I) and (II). We handle the sums separately according to whether k- £ - 1 < or not. To that 
aim, we define 

CDi = p X E P(0*^M>)P(i , jfc-/-i(K>,JC)o{ii/i ►y}), 

e-Q w,x,y: x~y 

and 

fc-2 

0)2 = P E E P(0^^)P(Pjfc-^-i(M/,x)o{u;t^y}), 

^=fc-l-r mix ^,Jc,y: x~y 

and similarly we define (II) i and (II) 2 . Our convention is that if k - 1 < r m ix, then (I) 2 =(II) 2 = 0. 
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It turns out that (I) i and (II) i contribute a negligible amount to 13.61 1. Indeed, when k- £ - 1 > T n 
we use Lemma l231 to bound 

CEIBCJfc — ^— 1)| 
P(iV,_i («/,*)) < 2m -. 

We use this estimate and the BK inequality to bound 

a)i<Cmp2- m £ E\dB(£)\[E\B(k-£-l)\) 2 . 

£<k-l-T mix 

We bound E\B{k-£- 1)| < E\B(k)\ < m~ 3 2 ml2 by our assumption on A; to get that 

E\dB{k-l)\ 

e<k-i-r m w e<k-i-T t 



(lh<Cm- 5 p £ E\dB(£)\<Cm- 5 p £ , •-.,/•/-, 

[1 + cm 6 ] K * 1 



£ rmx 



where the last inequality is due to our induction hypothesis. This yields the bound 

CDi < Cm~ 2 pE\dB(k- 1)| < Cm~ 3 E\dB{k-\)\, (3.8) 

since p = 0(m _1 ) and C > may depend on c. An almost identical calculation gives that 

(II)i<Cm~ 3 E|dB(Jfc-l)|. (3.9) 

Bounding (I) 2 and (11)2 is more delicate and the local structure of the hypercube comes into play. 
Let us start with bounding (II) 2 since it is slightly simpler. We start by bounding 

P(Pk-£-i(w, x) o Pk-e-iiw, y)) < m(m - ifk-2£-2 p zk-2e-2 p k-e-i,k-e-i {x> w> y) j 

where p fl,f2 {x, w, y) is the probability that a non-backtracking random walk starting from x visits w at 
time fi and visits y at time t\ + t 2 . The reason for this bound is that if the event on the left hand side 
occurs, then there exists a simple open path of length precisely 2k-2£-2 from x to y going through w 
at time k-£-\. The number of such paths is bounded above by m{m-l) 2k ~ 2( ~ 2 '^ k ~ e ~ 1 ' k ~ ( ~ l {x, w,y) 
and the estimate follows by the union bound. By transitivity we get that 

ai)2< (1+0( ^ 1))p2 E Emm £ m-i)p] 2k - 2£ - 2 P k - e - l *- £ -Hx,o,y). 

^=fc-l-T m jx x,y: d{x,y)-2 

Note that the sum over x, y in the right hand side does not depend on the 0, so by we may rewrite this 
sum as 

£ Km - 1) p] 2*-2*-2 p *-*-i,*-*-i (x> 0j y) 

x,y: d(x,y)=2 

= 2~ m £ ^[(m-i^-^-y-^^-W.y) 

x,y: d(x,y)=2 f 

= 2- w [(m-l)p] 2fc - 2 ^ 2 £ P 2fc " 2 '" 2 U,y) 

x,y: d(x,y)=2 

= £5^ZH [(m - 1) pj2fc-2.-2p2.-2.-2 (0> ei i) 

where in the last inequality we used the fact that on the hypercube, p f (x, y) is the same for any pair 
x,y such that d{x,y) = 2, and is the hypercube vector (1, 1,0,..., 0). We now use the induction 
hypothesis which implies that E| dB {£) | <E\dB{k- l)\ to get the bound of 

{\ + 0{m~ l ))p 2 m{m-l)E\dB{k-l)\ ^ x 9t 7t 

(H) 2 ^ — — — £ Km - l)p] 2t p 2 f (0, e u ) . 

4 r=i 
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We now appeal to Lemma [3^21 and use the fact that = O(mlogm) and that (m - l)p < 1 + 10/m 2 . 
A straightforward calculation with these gives that 

^ ? t ?t 2 + 0(m _1 ) 

£[(m-l)p] 2f p 2f (0,e u ) = — - (3.10) 
fr{ m(m-l) 



Thus, 



(II) 2 < yd + 0{m- l )W\dB{k- 1)| . (3.11) 



We proceed with bounding (1)2. We begin by estimating 

k-e-z k ~ e ~ 2 
V{P k -e- l {w,x)o{w < *y))< E ViPk-£-i(w,x)oP s {w,y)), 

and further bound, for each s, 

nP k -e-i(w,x)oP s (w,y)})<m(m-l) k - e - 1+s - 1 p k - e - 1+s p k - e - hs (x, w,y), 
where p fl,f2 [x, w, y) was defined earlier, and the reasoning for this bound is as before. We get 

k— 3 k—f—2 

(l) 2 <(l + 0(m- 1 ))p £ E|dfl(Q| E [im-l)p] k - e - 1+s E P*"' -1,, (x,0,y). (3.12) 

t=k-l-T mix s=0 X,y. x~y 

As before, the sum over x, y does not depend on 0, that is 

E p k - £ - 1 ' s (x,0,y) = 2- m Y E P k ~ £ ~ u (x,v,y)=2- m £ p fc -^ 1+s (x,y) = mp^" 1 ^,^), 

x,y: x~y y x,y: x~y x,y: x~y 

where ei is just the vector ( 1 , 0, . . . , 0) . As before we use the induction hypothesis to derive that E\dB(£)\< 
E|dB(Jfc-l)| to get that 

fc-3 k-e-2 

(I) 2 <(l + 0(m- 1 ))mpE|<9B(A;-l)| E E lim-l)p] k ~ i ~ 1+s P k ~ i ~ 1+ '(.0,ei). (3.13) 

£=k-l-T mix 5=0 

A straightforward manipulation with the double sum gives that 

2 Tmix 

{\) 2 <{\ + 0{m- l ))mpE\dB{k-\)\ E 2[f/2J[(m- l)p] f p f (0,<?i). 

f=3 

The dominant term here is t = 3. We appeal to Lemma I3~2l the fact that r m jx = O(mlogm) and that 
(m - l)p = 1 + 0(ra~ 2 ) to obtain that 

2r mix 2 + 0(m _1 ) 

E 2Lf/2J[(m-l)p] f p f ((Ui) = (3.14) 

^5 ra(ra-l) 

We get the bound 

(1)2 £ (1 + 0(ra -1 ))mpE|dB(A:- l)|[2m" 2 + 0(m" 3 )] . (3.15) 

Finally, we put this together with 113.81 1, H3.9H and H3.11I I into 113.611 to obtain 

E\dB{k)\ > [mp-(m-l)p 2 -2m" 1 p-p 2 /2-Cm" 3 ]E|a J B(A;-l)|. (3.16) 

Here we stress that since p < (1 + 10/m 2 )/(m - 1), the constants hidden in the O(-) in 113.1 OH -l IBTTBl are 
independent of the constant 6 from the definition of p, which implies that also C > may depend 
on c> but not on B. We plug in the value of p and a straightforward calculation shows that we can 
choose B > large enough such that 

-3 



mp- {m- l)p 2 -2m l p- p 2 12-Cm 3 > 1 + cm 



concluding our proof. 
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□ 

Proof of Proposition 13771 By LemmaO E\dB{k)\ > (1 + c/m 3 ) k as long as E|B(fc)| < 2 ml2 /m 3 . Hence, 
eventually E\B{ k) \ >2 m,2 /m 3 , proving the claim. □ 
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